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O ■ Abstract 

(N 

^ I Local algorithms on graphs are algorithms that run in parallel on the nodes of a graph 

MH' to compute some global structural feature of the graph. Such algorithms use only local 

information available at nodes to determine local aspects of the global structure, while also 

lO i potentially using some randomness. Recent research has shown that such algorithms show 

significant promise in computing structures like large independent sets in graphs locally. 

r/ . . Indeed the promise led to a conjecture by Hatami, Lovasz and Szegedy [HLSj that local 

Ph ! algorithms may be able to compute maximum independent sets in (sparse) random d- 

r~| ' regular graphs. In this paper we refute this conjecture and show that every independent 

set produced by local algorithms is multiplicative factor 1/2 + l/(2v2) smaller than the 

largest, asymptotically as d — >• c«. 

Our result is based on an important clustering phenomena predicted first in the litera- 
ture on spin glasses, and recently proved rigorously for a variety of constraint satisfaction 
problems on random graphs. Such properties suggest that the geometry of the solution 
space can be quite intricate. The specific clustering property, that we prove and apply in 
QQ ■ this paper shows that typically every two large independent sets in a random graph ei- 

ther have a significant intersection, or have a nearly empty intersection. As a result, large 
t;;j- I independent sets are clustered according to the proximity to each other. While the clus- 

^^ ' tering property was postulated earlier as an obstruction for the success of local algorithms, 

such as for example, the Belief Propagation algorithm, our result is the first one where the 
clustering property is used to formally prove limits on local algorithms. 



X 

^ ; 1 Introduction 

Local algorithms are decentralized algorithms that run in parallel on nodes in a network using 
only information available from local neighborhoods to compute some global function of data 
that is spread over the network. Local algorithms have been studied in the past in various com- 
munities. They arise as natural solution concepts in distributed computing (see, e.g., |Lin92] ). 
They also lead to efficient sub-linear algorithms — algorithms that run in time significantly less 
than the length of the input — and |PR,n7l INOOSi IKKNOOQi IRTVXllj illustrate some of the 
progress in this direction. Finally local algorithms have also been proposed as natural heuristics 
for solving hard optimization problems with the popular Belief Propagation algorithm (see for 
instance |WJ05t IMM09] ) being one such example. 
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In this work we study the performance of a natural class of local algorithms on random reg- 
ular graphs and show limits on the performance of these algorithms. The motivation for our 
work comes from the a notion of local algorithms that has appeared in a completely different 
mathematical context, namely that of the theory of graph limits, developed in several papers, 
including |BCL+n8j . [BCl7n2] . jLSn6] . |B(::LK] . jBncn8] . jEnn^ jHLS] . In the realms of this the- 
ory it was conjectured that every "reasonable" combinatorial optimization problem on random 
graphs can be solved by means of some local algorithms. To the best of our knowledge this 
conjecture for the first time was formally stated in Hatami, Lovasz and Szegedy in |HLS] . and 
thus, from now on, we will refer to it as Hatami-Lovasz-Szegedy (or HLS) conjecture, though 
informally it was posed by Szegedy earlier, and was referenced in several papers, including Lyons 
and Nazarov [LNllj . and Csoka and Lippner |CL12j . In a concrete context of the problem of 
finding largest independent sets in sparse random regular graphs, the conjecture is stated as 
follows. Let Td^r be a rooted (i-regular tree with depth r. Namely, every node including the root, 
has degree r, except for the leaves, and the distance from the root to every leaf is r. Consider 
a function fr : [0, l]^'*-'- — )• {0, 1} which maps every such tree whose nodes are decorated with 
real values from [0, 1] to a "decision" encoded by and 1. In light of the fact that in a random 
d- regular graph Gd(r) the typical node has depth-r neighborhood isomorphic to Td,r, for any 
constant r, such a function fr can be used to generate (random) subsets / of Grf(r) as follows: 
decorate nodes of Gd{r) using i.i.d. uniform random values from [0, 1] and apply function fr 
in every node. The set of nodes for which fr produces value 1 defines /, and is called "i.i.d. 
factor" . It is clear that fr essentially describes a local algorithm for producing sets I (sweeping 
issue of computability of fr under the rug). The HLS conjecture postulates the existence of a 
sequence oi fr,r = 1,2, .. ., such that the set / thus produced is an independent subset of Gd(r) 
and asymptotically achieves the largest possible value as r — )■ oo. Namely, largest independent 
subsets of random regular graphs are i.i.d. factors. The precise connection between this conjec- 
ture and the theory of graph limits is beyond the scope of this paper. Instead we refer the reader 
to the relevant papers [HLSj . jELlOj . The concept of i.i.d. factors appears also in one of the open 
problem by David Aldous |Ald] in the context of coding invariant processes on infinite trees. 

It turns out that an analogue for the HLS conjecture is indeed valid for another important 
combinatorial optimization problem - matching. Lyons and Nazarov [LNllj established it for the 
case of bi-partite locally T^ ,.-tree-like graphs, and Csoka and Lippner established this result for 
general locally Td,.-tree-like graphs. Further, one can modify the framework of i.i.d. factors by 
encapsulating non-T^^ ,. type neighborhoods, for example by making fr depend not only on the 
realization of random uniform in [0, 1] values, but also on the realization of the graph-theoretic 
neighborhoods around the nodes. Some probabilistic bound on a degree might be needed to make 
this definition rigorous (though we will not attempt this formalization in this paper). In this case 
one can consider, for example, i.i.d. factors when neighborhoods are distributed as r generations 
of a branching process with Poisson distribution, and then ask which combinatorial optimization 
problems defined now on sparse Erdos-Renyi graphs G(n, d/n) can be solved as i.i.d. factors. 
Here G(n, d/n) is a random graph on n nodes with each of the (g) edges selected with probability 
d/n, independently for all edges, and c? > is a fixed constant. In this case it is possible to show 
that when c < e, the maximum independent set problem on G{n, d/n) can be solved nearly 
optimally by the well known Belief Propagation (BP) algorithm with constantly many rounds. 
Since the BP is a local algorithm, then the maximum independent set on G(n, d/n) is an i.i.d. 
factor, in the extended framework defined above. (We should note that the original proof of Karp 



and Sipser |KS81] of the very similar result, relied on a different method.) Thus, the framework 
of local algorithms viewed as i.i.d. factors is rich enough to solve several interesting combinatorial 
optimization problems. 

Nevertheless, in this paper we refute the HLS conjecture in the context of maximum indepen- 
dent set problem on random regular graphs Gd(n). Specifically, we show that for large enough 
d, with high probability as n ^ oo, every independent set producible as an i.i.d. factor is a 
multiplicative factor 7 < 1 smaller than the largest independent subset of Grf(ra). We establish 
that 7 is asymptotically at most | + -j= (though we conjecture that the result holds simply for 
7 = 1/2, as we discuss in the body of the paper). 

Our result is based on a powerful, though fairly simple to establish in our case, so-called 
clustering or shattering property of some combinatorial optimization problem on random graphs, 
first conjectured in the theory of spin glasses and later confirmed by rigorous means. For the first 
time this clustering property was discussed in terms of so-called overlap structure of the solutions 
of the Sherrington-Kirkpatrick model |TallO] . Later, it featured in the context of random K-SAT 
problem and was proved rigorously by Achlioptas, Coja-Oghlan and Ricci-Tersenghi |ACORTllj . 
and by Mezard, Mora and Zecchina [MMZOSJ . independently. We do not define the random K- 
SAT problem here and instead refer the reader to the aforementioned papers. What these results 
state is that in certain regimes, the set of satisfying assignments, with high probability, can 
be clustered into groups such that two solutions within the same cluster agree on a certain 
minimum number of variables, while two solutions from different clusters have to disagree on 
a certain minimum number of variables. In particular, one can identify a certain non-empty 
interval [zi,Z2] C [0,1] such that no two solutions of the random K-SAT problem agree on 
precisely z fraction of variables for all z G [zi,Z2]- One can further show that the onset of 
clustering property occurs when the density of clauses to variables becomes at least 2^ /K, while 
at the same time the formula remains satisfiable with high probability, when the density is below 
2^ log 2. Interestingly, the known algorithms for finding solutions of random instances of K- 
SAT problem also stop working around the 2^ /K threshold. It was widely conjectured that 
the onset of the clustering phase is the main obstruction for finding such algorithms. In fact, 
Coja Oghlan [COll] showed that the BP algorithm, which was earlier conjectured to be a good 
contender for solving the random instances of K-SAT problems, also fails when the density of 
clauses to variables is at least 2^ log K/K, though Coja-Oghlan's approach does not explicitly 
rely on the clustering property, and one could argue that the connection between the clustering 
property and the failure of the BP algorithm is coincidental. 

Closer to the topic of this paper, the clustering property was also recently established for 
independent sets in Erdos-Renyi graphs. In particular Coja-Oghlan and Efthymiou [COEll] 
established the following result. It is known that the largest independent subset of G(n, d/n) 
has size approximately {2\og d/d)n, when d is large, see the next section for precise details. The 
authors of jCOEll] show that the set of independent sets of size at least approximately (log d/d)n 
(namely those within factor 1/2 of the optimal), are also clustered. Namely, one can split them 
into groups such that intersection of two independent sets within a group has a large cardinality, 
while intersection of two independent sets from different groups has a small cardinality. One 
should note that algorithms for producing large independent subsets of random graphs also stop 
short factor 1/2 of the optimal, both in the case of sparse and in the dense random graph 
cases, as exhibited by the well-known Karp's open problem regarding independent subsets of 
(G(n,l/2) [XS92] . 



This is almost the result we need for our analysis with two exceptions. First, we need to 
establish this clustering property for random regular as opposed Erdos-Renyi graphs. Second, 
the result in |COEllj apphes to typical independent sets and does not rule out the possibility 
that there are two independent sets with some "intermediate" intersection cardinality, though 
the number of such pairs is insignificant compared to the total number of independent sets. For 
our result we need to show that, without exception, every pair of "large" independent sets has 
either large or small intersection. We indeed establish this, but at the cost of loosing additional 
factor 1/(2a/2). In particular, we show that for large enough d, with high probability as n ^ cxd, 
every two independent subsets of Grf(n) with cardinality asymptotically (1 + (3) {log d/d)n, where 
1 > /3 > 1 + ^75 either have intersection size at least {1 + z) (log d/d)n or at most {l—z)(logd/d)n, 
for some z < p. The result is established using a straightforward first moment argument: 
we compute the expected number of pairs of independent sets with intersection lying in the 
interval [(1 — z)(logd/d)n, (1 + z)(logd/d)n], and show that this expectation converges to zero 
exponentially fast. 

With this result at hand, the refutation of the HLS conjecture is fairly simple to derive. 
We prove that if local algorithms can construct independent sets of size asymptotically (1 + 
P){log d/d)n, then, by means of a simple coupling construction, we can construct two independent 
sets with intersection size z for all z in the interval [{1 + (3)'^ (log d/d)'^n, (1 + (3) (log d/d)n], clearly 
violating the clustering property. The additional factor l/(2v2) is an artifact of the analysis, 
and hence we believe that our result holds for all /3 G (0, 1]. Namely, no local algorithm is capable 
of producing independent sets with size larger than factor 1/2 of the optimal, asymptotically in 
d. We note again that this coincides with the barrier for known algorithms. It is noteworthy 
that our result is the first one where algorithmic hardness derivation relies directly on the the 
geometry of the solution space, vis-a-vis the clustering phenomena, and thus the connection 
between algorithmic hardness and clustering property is not coincidental. 

The remainder of the paper is structured as follows. We introduce some basic material and 
the HLS conjecture in the next section. In the same section we state our main theorem — 
non- validity of the conjecture (Theorem 12. 5p . We also state two secondary theorems, the first 
describing the overlap structure of independent sets in random graphs (Theorem 12. 6p - the main 
tool in the proof of our result, and the second describing overlaps that can be found if local 
algorithms work well (Theorem 12. 7p . We prove our main theorem easily from the two secondary 
theorems in Section [31 We prove Theorem 12.71 in Section HI Sections [5] and |6] are devoted to 
proofs of the theorem regarding the overlap property, for the case of Erdos-Renyi and random 
regular graph, respectively. While technically we do not need such a result for the Erdos-Renyi 
graph, it is very simple to derive and provides the roadmap for the case of the regular graphs 
(where the calculations are a bit more tedious). The Erdos-Renyi case might also be useful for 
further studies of i.i.d. factors on Erdos-Renyi graphs as opposed to random regular graphs, in 
the framework described above. 

2 Preliminaries and main result 

For convenience, we repeat here some of the notions and definitions already introduced in the 
first section. 



Basic graph terminology All graphs in this paper are understood to be simple undirected 
graphs. Given a graph G with node set V{G) and edge set E[G), a subset of nodes I C V{G) 
is an independent set if {u,v) ^ E{G) for all u,v E I. A path between nodes u and v with 
length r is a sequence of nodes -ui, . . . , Ur-i such that (m, -ui), (mi, ^2), • • • , (wr-i, "t^) £ -E(G). The 
distance between nodes u and v is the length of the shortest path between them. For every 
positive integer value r and every node u G V^(G), Bi[;{u,r) denotes the depth-r neighborhood of 
u in G. Namely, Bq{u, r) is the subgraph of G induced by nodes v with distance at most r from 
u. When G is clear from context we drop the subscript. The degree of a vertex u G V{G) is the 
number of vertices v such that {u,v) G E{G). The degree of a graph G is the maximum degree 
of a vertex of G. A graph G is d-regular if the degree of every node is d. 

Random graph preliminaries Given a positive real d, G{n, d/n) denotes the Erdos-Renyi 
graph on n nodes {1, 2, . . . , n} = [n], with edge probability d/n. Namely each of the (2) edges of 
a complete graph on n nodes belongs to E{G{n,d/n)) with probability d/n, independently for 
all edges. Given a positive integer d, Gd{n) denotes a graph chosen uniformly at random from 
the space of all d-regular graphs on n nodes. This definition is meaningful only when nd is an 
even number, which we assume from now on. Given a positive integer m, let X(n, d, m) denote 
the set of all independent sets in G{n,d/n) with cardinality m. Xd{n,m) stands for a similar 
set for the case of random regular graphs. Given integers < A; < m, let 0{n,d,ni,k) denote 
the set of pairs I, J E X(n, d, m) such that |/ fl J| = k. The definition of the set Odin, m, k) is 
similar. The sizes of the sets 0{n, d, m, k) and Od{n, m, k), and in particular whether these sets 
are empty or not, is one of our focuses. 

Denote by a{n,d) the size of a largest in cardinality independent subset of G{n,d/n), nor- 
malized by n. Namely, 

a{n, d) = n^^ maxJTn : X(n, d, m) 7^ 0}. 

oid{n) stands for the similar quantity for random regular graphs. It is known that a{n, d) and 
oid{n) have deterministic limits as n — )■ 00. 

Theorem 2.1. For every d G M_|_ there exists a{d) such that w.h.p. as n ^ 00, 

a{n, d) — )■ a{d). (1) 

Similarly, for every positive integer d there exists ad such that w.h.p. as n —t- 00 

ad{n) -)■ ad. (2) 



Furthermore 



«(rf) = ^(l-o(l)), (3) 

ad = ^^{1-0(1)), (4) 



as d ^ 00. 



The convergence ([T]) and ([2]) was established in Bayati, Gamarnik and Tetah |BGT10] . The 
hmits (|3]) and (j4]) follow from much older results by Frieze |Fri90] for the case of Erdos-Renyi 
graphs and by Frieze and Luczak |FL92] for the case of random regular graphs, which established 
these limits in the lim sup„ and lim inf „ sense. The fallout of these results is that graphs G(n, d/n) 
and G(i(n) have independent sets of size up to approximately (2 log d/d)n, when n and d are large, 
namely in the doubly asymptotic sense when we first take n to infinity and then d to infinity. 

Local graph terminology A decision function is a measurable function / = f{u, G, x) where 
G is a graph on vertex set [n] for some positive integer n, u & [n] is a vertex and x G [0, 1]^ 
is a sequence of real numbers for some N > n and returns a Boolean value {0, 1}. A decision 
function / is said to compute an independent set if for every graph G and every sequence x and 
for every pair {u,v) G E{G) it is the case that either /(m,G, x) = or /(i;,G, x) = 0, or both. 
We refer to such an / as an independence function. For an independence function /, graph G 
on vertex set [n] and x G [0, 1]-^ for N > n, we let /g(/, x) denote the independent set of G 
returned by /, i.e., /(g(/, x) = {m G [n] \ /(m,G, x) = 1}. We will assume later that X is chosen 
randomly according to some probability distribution. In this case J(q(/, x) is a randomly chosen 
independent set in G. 

We now define the notion of a "local" decision function, i.e., one whose actions depend only 
on the local structure of a graph and the local randomness. The definition is a natural one, but 
we formalize it below for completeness. Let Gi and G2 be graphs on vertex sets [rii] and [112] 
respectively. Let Ui G [rii] and U2 G [n2]. We say that n : [rii] — )■ [722] is an r-local isomorphism 
mapping Ui to M2 if tt is a graph isomorphism from BQ-^^{ui,r) to BQ^{u2,r) (so in particular it 
is a bijection from BQ-^^{ui,r) to BQ^{u2,r), and further it preserves adjacency within i?G^(Mi,r) 
and Bg^{u2, r)). For Gi, G2, ui, U2 and an r-local isomorphism vr, we say sequences x^^^ G [0, l]-^^ 
and x*-^-* G [0, 1]^^ are r-locally equivalent if for every v G -Bgi {ui, r) we have Xv = a^L^,)- Finally 
we say f{u, G, x) is an r-local function if for every pair of graphs Gi, G2, for every pair of vertices 
Ui G V(Gi) and U2 G V^(G2), for every r-local isomorphism vr mapping Ui to U2 and r-locally 
equivalent sequences x^^"* and x^'^^ we have f{ui,Gi,x^^^) = /('U25 G2,x*^^^). We often use the 
notation fr to denote an r-local function. 

Let rid^r = 1 + d ■ {{d — 1)^' — l)/{d — 2) denote the number of vertices in a rooted tree of 
degree d and depth r. We let T^^,. denote a canonical rooted tree on vertex set [ud^r] with root 
being 1. For n > rid^ri^ ^ [0,1]" and an r-local function Z^, we let /r(x) denote the quantity 
/r(l, Td,r;x). Let X be chosen according to a uniform distribution on [0, 1]". The set subset of 
nodes /Gd(n)(/r, X) is called i.i.d. factor produced by the r-local function fr. As we will see below 
the oi.{fr) — ^ ■ Ex[/r(X)] accurately captures (to within an additive o(l) factor) the density of 
an independent returned by an r-local independence function fr on Gd(ri). 

First we recall the following folklore proposition which we will also use often in this paper. 

Proposition 2.2. As n ^ 00, with probability tending to 1 almost all local neighborhoods in 
G(i(n) look like a tree. Formally, for every d, r and t, for sufficiently large n, 

Pg.W {\{u G [n] I 5G,(„)(M,r) ^ Trf,,}| > en) < e. 

This immediately implies that the expected value of the independent set /(G^(„)(/r, X) pro- 
duced by fr is a{fr)n + o{n). In fact the following concentration result holds. 



Proposition 2.3. As n -^ oo, with probability tending to 1 the independent set produced by a 
r- local function f on Gd(n) is of size a{f) -n + oln). Formally, for every d, r, e and every r-local 
function f , for sufficiently large n, 

PGdH,X6[o,ir (II^Gd(n)(/r,X)| - a{fr)n\ > eu) < e. 

Proof. The proof follows from by the fact that the variance of |/Gd(n),x | is 0{n) and its expectation 
is a{fr)n + o(n), and so the concentration follows by Chebychev's inequality. The bound on the 
variance in turn follows from the fact that for every graph G, there are at most 0{n) pairs of 
vertices u and v for which the events /(u, G, X) and /(w, G, X) are not independent for random 
X. Details omitted. D 

The Hatami-Lovasz-Szegedy Conjecture and our result We now turn to describing the 
Hatami-Lovasz-Szegedy (HLS) conjecture and our result. Recall a^ defined by ([2]). The HLS 
conjecture can be stated as follows. 

Conjecture 2.4. There exists a sequence of r-local independence functions fr,i^ ^ ^ such that 
almost surely I{fr, n) is an independent set in Gd{n) and a{fr) — )■ a^ as r —t- oo. 

Namely, the conjecture asserts the existence of a local algorithm (r-local independence func- 
tion fr) which is capable of producing independent sets in Gd(r) of cardinality close to the largest 
that exist. For such an algorithm to be efficient the function fr{u, G, x) should also be efficiently 
computable uniformly. Even setting this issue aside, we show that there is a limit on the power 
of local algorithms to find large independent sets in Gd(n) and in particular the HLS conjecture 
does not hold. Let d^ = sup^ sup j^ «(/,.), where the second supremum is taken over all r-local 
independence functions fr- 

Theorem 2.5. [Main] For every e > and all sufficiently large d, 

ad 1 1 
ad 2 2^2 

That is, for every e > and for all sufficiently large d, a largest independent set obtainable by 
r-local functions is at most ^ + -j= + e for all r. 

Thus for all large enough d there is a multiplicative gap between aa and the independence 
ratio ad- That being said, our result does not rule out that for small d, ckd in fact equals a^, thus 
leaving the HLS conjecture open in this regime. 

The two main ingredients in our proof of Theorem 12.51 both deal with the overlaps between 
independent sets in random regular graphs. Informally, our first result on the size of the overlaps 
shows that in random graphs the overlaps are not of "intermediate" size — this is formalized 
in Theorem 12.61 We then show that we can apply any r-local function /^ twice, with coupled 
randomness, to produce two independent sets of intermediate overlap where the size of the 
overlap depends on the size of the independent sets found by fr and the level of coupling. This 
is formalized in Theorem 12.71 Theorem 12.51 follows immediately by combinig the two theorems 
(and appropriate setting of parameters). 



Overlaps in random graphs We now state our main tlieorem about tlie overlap of large 
independent sets. We interpret the statement after we make the formal statement. 

Theorem 2.6. For (3 G (1/^2, 1) andO < z < a/2/32 _ i <: p andd, let s = {1 + (3)d-^ log d and 
let K(z) denote the set of integers between ' ~^'^ °^ and (. +^)^ °s ^ Then, for all large enough 
d, we have 

lim Pf UkeKiz) 0{n, d, [sn\ , fc) ^ 0) = 0, (5) 

and 

lim P(^ UkeKiz) Od{n, [sn\ , fc) ^^ 0) = 0. (6) 



n— >oo 



In other words, both in the Erdos-Renyi and in the random regular graph models, when 
(3 > 1/a/2, and d is large enough, with probability approaching unity as n — t- oo, one cannot 
find a pair of independent sets I and J with size \ns\ , such that their overlap (intersection) has 
cardinality at least ~^J °^ and at most — — ^^ °^ . 

Note that for all /3 > l/\/2, there exists z satisfying < ^ < a/2/32 _ i g^^^ g^ |-]^g theorem is 
not vacuous in this setting. Furthermore as /3 — )■ 1, z can be chosen arbitrarily close to 1 making 
the forbidden overlap region extremely broad. That is, as the size of the independent sets in 
consideration approaches the maximum possible (namely as /3 '|" 1), and as rf — > oo, we can take 
z —^ 1. In other words, with probability approaching one, two nearly largest independent sets 
either overlap almost entirely or almost do not have an intersection. This is the key result for 
establishing our hardness bounds for existence of local algorithms. 

A slightly different version of the first of these results can be found as Lemma 12 in |COEllj . 
The latter paper shows that if an independent set I is chosen uniformly at random from the set 
with size nearly (1 + l3)n\ogd/d, then with high probability (with respect to the choice of J), 
there exists an empty overlap region in the sense described above. In fact, this empty overlap 
region exists for every /3 G (0, 1), as opposed to just 1 > /3 > 1/2 + 1/(2a/2) as in our case. 
Unfortunately, this result cannot be used for our purposes, since this result does not rule out the 
existence of rare sets / for which no empty overlap exists. 

Overlapping from local algorithms Next we turn to the formalizing the notion of using a 
local function fr twice on coupled randomness to produce overlapping independent sets. 

Fix an r-local independence function fr- Given a vector X = {Xu, 1 < u < n) oi variables 
Xu G [0, 1], recall that iG^fr, X) denotes the independent set of G given by m G IgUt, X) if and 
only if friu, G, X) = 1. 

Recall that X is chosen according to the uniform distribution on [0, 1]". Namely, X^ are 
independent and uniformly distributed over [0,1]. In what follows we consider some joint dis- 
tributions on pairs of vectors (X, Y) such that marginal distributions on the vector X and Y 
are uniform on [0, 1]", though X and Y are dependent on each other. The intuition behind 
the proof of Theorem 12.51 is as follows. Note that if X = Y then /G(/r,X) = I^i^fr.Y). As 
a result the overlap /(G(/r,X) fl /^(/r, Y) between I^^fr,^) and /^(/r, Y) is a{fr)n + o{n) in 
expectation. On the other hand, if X and Y are independent, then the overlap between /^(/r j X) 
and iQ^fr, Y) is a'^{fr)n + o{n) in expectation, since the decision to pick a vertex m in / is inde- 
pendent for most vertices when X and Y are independent. (In particular, note that if the local 

8 



neighborhood around m is a tree, which according to Proposition 12.21 happens with probabihty 
approaching unity, then the two decisions are independent, and m G / with probabihty a{fr)-) 
Our main theorem shows that by couphng the variables, the overlap can be arranged to be of 
any intermediate size, to within an additive o{n) factor. In particular, if a{fr) exceeds | + r^ 

we will be able to show that the overlap can be arranged to be between the values ~^ ^ °^ ^^*^ 
( +z)n og ^ (described in Theorem 12.61 which contradicts the statement of this theorem. 

Theorem 2.7. Fix a positive integer d. For constant r, let fr{u, G, x) be an r-local independence 
function and let a = a{fr)- For every 7 G [a^,a] and e > 0, and for every sufficiently large n, 
there exists a distribution on variables (X, Y) G [0, 1]" x [0, 1]" such that 

PG,(n),(x,Y) (|/G,(n)(/r,X)n/G,(„)(/.,Y)| ^ [(7 - e)n, (7 + e)n]) <e. 



3 Proof of Theorem 12.5 



We now show how Theorems 12.61 and 12.71 immediately imply Theorem 

Proof of Theorem \2.5[ Fix an r-local function fr and let a = a{fr)- Fix < 77 < 1. We will 
prove below that for sufficiently large d we have a/ad < 1/2 + l/(2-\/2) + r]. The theorem will 
then follow. 

Let e = '' °| . By Proposition 12.31 we have that almost surely an independent set returned by 

„2 



fr on Gdin) is of size at least {a — e)n. Furthermore for every 7 G [a , a] we have, by Theorem 12. 7[ 
that Gd{n) almost surely has two independent sets / and J, with 

|/|, 1^1 > (a - (^)n and |/ n J| G [(7 - e)n, (7 + e)n]. (7) 

Finally, by Theorem 12. ![ we have that for sufficiently large d, |/|, \J\ < {2d^^logd){l + ri)n < 
Ad~^\ogdn and so a^ < d~^logd, allowing us to set 7 = d~^/\ogd. 



Now we apply Theorem 12.61 with z = ed/logd and (3 > y ^-^- (Note that for this choice we 

have z < 1 and z < a/2/3^ — 1 < (3 < 1. We will also use later the fact that for this choice we 
have (3 < l/-\/2 + z = l/-\/2 + ed~^logd.) Theorem 12.61 asserts that almost surely Gd{n) has no 
independent sets of size at least (1 + P)d^^ logdn with intersection size in [(1 — z)d~^ logdn, (1 + 
z)d~^logdn]. Since |/ H J| G [(7 — e)n, (7 + e)n] = [(1 — z)d~^\ogdn, (1 + z)d''^\ogdn], we 
conclude that min{|/|, | J|} < (1 + (3)d~^\ogdn. Combining with Equation ([7]) we get that 
{a — e)n < min{|/|, \J\} < (1 + [3)d~'^ \ogdn and so a < (1 + l3)d~^\ogd + e, which by the given 
bound on /3 yields a < (1 + l/\/2)d-^ \ogd + 2e = (1 + 1/^2 + 77)^"^ \ogd. On the other hand 
we also have a^ > (2 — r])d^^ log d. It follows that a/ ad < 1/2 + l/2-\/2 + r^ as desired. 

D 



4 Proof of Theorem 12.7 



For parameter p G [0, 1], we define the p-correlated distribution on vectors of random variables 
(X, Y) to be the following: Let X, Z be independent uniform vectors over [0,1]". Now let 
Zu = Xu with probability p and Yu with probability 1 — p independently for every u G V{G). 



Let /(m, G, x) and a be as in the theorem statement. Recall that /(x) = /(l,Trf^j.,x) is the 
decision of / on the canonical tree of degree d and depth r rooted at the vertex 1. Let 7(j9) 
be the probability that /(X) = 1 and /(Y) = 1, for p-correlated variables (X, Y). As with 
Proposition 12.31 we have the following. 

Lemma 4.1. For every d, r, e > and r -local function f , for sufficiently large n we have: 

PG,(n),(x,Y) (||/G,(n)(/,X) H /G,(n)(/, Y)| - ^{p) ■ n\ > en) < e, 

where (X, Y) are p- correlated distributions on [0,1]". 

Proof. By Proposition l2.2l we have that almost surely almost all local neighborhoods are trees and 
so for most vertices u the probability that u is chosen to be in the independent sets /(/, X) and 
/(/, Y) is 7(p). By linearity of expectations we get that E[|/(/, X) fl /(/, Y)|] = 7(p) • n + o{n). 
Again observing that most local neighborhoods are disjoint we have that the variance of |/(/, X)n 
/(/, Y)| is 0{n). We conclude, by applying the Chebychev bound, that |J(/, X) fl /(/, Y)| is 
concentrated around the expectation and the lemma follows. D 

We also note that for p = 1 and p = the quantity 7(p) follow immediately from their 
definition. 

Proposition 4.2. 7(1) = a and 7(0) = a^ . 

Now to prove Theorem 12.71 it suffices to prove that for every 7 G [a^, a] there exists a p such 
that 7(p) = 7. We show this next by showing that 7(p) is continuous. 

Lemma 4.3. For every r, 7(p) is a continuous function of p. 

Proof. Let {Wu)'u € T^,.) be random variables associated with nodes in T^,,, uniformly dis- 
tributed over [0, 1], which are independent for different u and also independent from X„ and Z^- 
We use Wu as generators for the events Y^ = X^ vs Yu = Z^- In particular, given p, set Yu = X^ 
if 'Wu < p and Yu = Zu otherwise. This process is exactly the process of setting variables Yu to 
Xu and Zu with probabilities p and 1 — p respectively, independently for all nodes u. Now fix 
any pi < p2, and let 6 < (p2 — Pi)/d^~^^. We use the notation fr{Xu,Zu,Wu,p) to denote the 
value of fr when the seed variables realization is {Wu,u G T^,,), and the threshold value p is 
used. Namely, fr{Xu, Zu, Wu,p) = fr {Xul{Wu < p} + Zul{Wu > p},ue T^,,.). Here, for ease 
of notation, the reference to the tree Td,r is dropped. Utilizing this notation we have 

7(p) = P ifriXu) = friXu, Zu, Wu,p) = 1) . 

Therefore, 

l{p2)-l{pi)=^{fr{Xu) = fr{Xu,Zu,Wu,P2) = 1) " P (/r(X„) = fr{Xu, Zu,Wu, Pi) = 1) 
= E[fr{Xu)fr{Xu, Zu, Wu,P2) ' /.(X„)/,(X„, Z„, Wu,Pl)]. 

Observe that the event Wu 4- [Pi^Pa] for all u G T^,,- implies fr{Xu, Zu, Wu,P\) = fr{Xu, Zu, Wu,P2) 
for every realization of X„ and Zu- Therefore, by the union bound and since \Tfi^r\ < d^'^^, we 
have 

17(^2) -l{Pi)\ < d''^\p2-pi). 
Since r is fixed, the continuity of 7(p) is established. D 
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We are now ready to prove Theorem 12.71 

Proof of Theorem \2.1\ Given 7 G [a^, a] by Lemma 14.31 we have that there exists a p such that 
7 = l{p)- For this choice of p, let (X, Y) be a pair of p-correlated distributions. Applying 
Lemma l4T] to this choice of p, we get that with probability at least 1 — e we have |/g^(„)(/, X) fl 
^Gd(n)(/, Y)| G [(7 - e)n, (7 + e)n] as desired. D 



5 Theorem 12.61 : Case of the Erdos-Renyi graph G(n, d/n 



In this section we prove Theorem 12.61 for the case of the random Erdos-Renyi graph. Specifically 
we show that the overlap of two independent sets of near maximum cardinality can not be of 
some intermediate sizes. 

The proof is based on a simple moment argument. We first determine the expected number of 
pairs of independent sets with a prescribed overlap size and show that this expectation converges 
to zero as n — )■ oo and in fact converges to zero exponentially fast when the overlap size falls into 
the corresponding inverval. The result then follows from Markov inequality. 

Fix positive integers k < m < n. Recall that 0{n, d, m, k) is the set of all pairs of independent 
sets of cardinality m with intersection size k in the random graph G(n, d/n). It is straightforward 
to see that 



EHOMrnM] = ,,(„ _ ,),(,„ _ „,(„ _ „„ ^ ,), ^1 - -j (8) 

Let m = \ns\, where we remind that s = (1 + f3)d^^logd is given by the statement of the 
theorem. Set k = \nx\ for any 

^ ^ ni-z)\ogd ii + z)\ogd \ 

It suffices to show that there exists 7 > such that 

limsupn""'"logE[|(9(r2,(i, [ns\, [nx\)\] < —7, (10) 

n—>oo 

for all X in the interval ([9]), as then we can use a union bound on the integer choices 

/ {l-z)\ogd {l + z)\ogd 

k e n ; ,n ; 

\ d d 

From this point on we ignore [-J notation for the ease of exposition. It should be clear that 
this does not affect the argument. From ([8]), after simplifying using Stirling's approximation 
(a! ~ (a/e)") and the fact that ln(l — y) ^ —y as y ^ 0, we have 



limsupn "*" logE[|C(n, c?, [nsj, [nxj 

n 

= xloga;""*^ + 2(s — x) log(s — x)^^ + (1 — 2s + a;)log(l — 2s + x)~^ 
-4Pl^_(._.)^) (U) 
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We further simplify this expression as 

X log x~^ + 2(s - x) log(s - x)"^ + (1 - 2s + x) log(l - 2s + x)"^ - ds^ + dx^ j^. 
We have from ([9]) that for large enough d 

x~^ < d. 
Also, for large enough d, since z < (3, then 

Finally, we use the following following asymptotics valid as (i — )■ oo: 

(1 - 2s + x) log(l - 2s + x)-i = O 0^) , (12) 

which applies since < x < s = 0{\ogd/d). Substituting the expression for s = (1 + (3)d~^ logc?, 
we obtain a bound 

n-HogE[\0{ns,nx)\] < xlogd + 2 ( ^^ ^ ^^^^^^ - xj \ogd + 0{\ogd/d) 

Writing x = (1 + z) log d/d, where according to ([9]) i varies in the interval [— z,^;], we can 
conveniently rewrite our bound as 



log^rf 



(2(1 + /3) - (1 + 13^ -{l + z) + {l + z) V2) + 0(log d/d). 



d 
Now we can force the expression to be negative for large enough d, provided that 

2(1 + /3) - (1 + /3)2 _ (1 + i) + (1 + i)2/2 < 0, 

which is equivalent to \z\ < \/2(3'^ — 1 which in turn follows from the conditions on z in the 
hypothesis of the theorem statement. 

This completes the proof of ([5]) and thus the proof of the theorem for the case of Erdos-Renyi 
graph. 



6 Theorem 12.61 : Case of the random regular graph Gd{n 



We now turn to the case of random regular graphs G„((i). We use a configuration model of 
Gdin) |Bol85] . pLR00] . which is obtained by replicating each of the n nodes of the graph d times, 
and then creating a random uniformly chosen matching connecting these dn nodes. Since nd 
is assumed to be even, such a matching exists. Then for every two nodes u,v & [n] an edge is 
created between u and v, if there exists at least one edge between any of the replicas of u and 

12 



any of the replicas of v. This step of creating edges between nodes in [n] from the matching on 
nd nodes we call projecting. It is known that, conditioned on the absence of loops and parallel 
edges, this gives a model of a random regular graph. It is also known that the probability of 
appearing of at least one loop or at least two parallel edges is bounded away from zero when d is 
bounded. Since we are only concerned with statements taking place with high probability, such 
a conditioning is irrelevant to us and thus we assume that Gd(n) is obtained simply by taking 
a random uniformly chosen matching and projecting. The configuration model is denoted by 
G(i(n), with nodes denoted by (i, r) where i = 1, 2, . . . , n and r = 1, . . . , d. Namely, (i, r) is the 
r-th replica of node i in the original graph. Given any set Ad [n], let A be the natural extension 
of A into the configuration model. Namely A = {(i, r) : i G /, r = 1, . . . , rf}. 

Recall that Odin, m, k) stands for the set of pairs of independent sets /, J in Gd(n) such that 
l-^l = l"^! = ''^ ^^d I J n J| = k. Note that there are possibly some edges between I \ J and J \I 
resulting in edges between I \ J and J \ I. Let 7l{m, k, I) C Od{n, m, k) be the set of pairs /, J 
such that the number of edges between I\J and J \ / in the configuration graph model Qdin) 
is exactly /. Here, for the ease of notation we dropped the references to d and n. Observe that 
/ is at most d{m — k) and U^^*^ TZ{m,k,l) = Od{n,m,k). In what follows we will bound the 
expected size of 7l{m, k, I) for every /, and thus the expected size of their union. 

For (/, J) G 7l{m, k, I) the number of edges between the set lUJ and its complement [n]\(/U J) 
is precisely {2m — k)d — 21, since |/ U J| = 2m — k. The same applies to the configuration model: 
the number of edges between / U J and its complement [nd] \ (/ U J) is precisely (2m — k)d — 21. 
The value of E[|7^(m, k, l)\] is then computed as follows. Let R = 2m — k and / < d{m — k). 



Lemma 6.1. 



n \ fmd — kd\ (nd — Rd 



E|,JKM)l = (,,,„_,,„_,,„_«j( , j U<i- 2/ I ""*"-'"' ^ 



X 



{nd-2Rd + 2l)\ {nd /2)\2 



nd 
2 



{nd/2 -Rd + l)\2^d/2-Rd+i (^^) I • 
Proof. The proof is based on the fact that the number of matchings on a set of m nodes (for 



nd 



even m) is — ™' m . So the term ^" ; ',.. — is precisely the inverse of the number of configuration 

'_ (m/2)!2^ M)! f J to 

graphs Gtd{n). The term (^ „_;- J|^_;j „_r) is the number of ways of selecting a pair of sets / and 
J with cardinality m each and intersection size k. Finally, 

md — kd\ [nd — Rd\ {nd — 2Rd + 2l)\ 

l\{Rd — 2/)!- 



/ ) \Rd-2l) ■' '■(nd/2-i?rf + /)!2"'^/2-«rf+' 

is the number of graphs Gd(n) such that for a given choice of sets / and J, both sets are 

independent sets, and the number of edges between I\J and J \ / is /. Here (™ 7 ) represents 
the number of choices for end points of the / edges between / \ J and J \ I', /! represents the 
number of matchings once these choices are made; ('^^2« ) represents the number of choices for 
the end points of edges connecting / U J with its complement; {Rd — 2/)! represents the number 
of matchings once these choices are made; and finally 

{nd-2Rd + 2l)\ 



(nrf/2-i?rf + /)!2"'^/2-M+i 
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represents the number of matching choices between the remaining nd — 2Rd + 21 nodes in the 
complement of / U J. D 

We write k = xn,m = sn, I = dyn, where x < s < 1. Then R = (2s — x)n and y < s — x. 
Our main goal is establishing the following analogue of ( ITOj) . 

Lemma 6.2. There exists 7 > such that 

limsupn^"*" logE[|7^([nsJ, [nxj, [?^l/J)|] < —7, (13) 

for s = (1 + f3)d^^ log(i, for all x in the interval ([^ and all < y < s — x . 

The claim ([6]) of Theorem 12.51 follows from Lemma 16.21 by an argument similar to the one for 
the Erdos-Renyi graph. The rest of this section is devoted to proving Lemma 16.21 
By Lemma 16. ![ we have 

n \ fmd — kd\ (nd — Rd 



[nd — 2Rd + 2l) 2 e -^ 

(nd-2fld+"20 Z TT^i" '^ ^\^)) 

e 2 (nd) 2 

n\ {{md-kd)\f {nd-Rd)\ 

k\{{m - k)\Y{n - R)\ {ny{{md - kd - l)\y {Rd - 2l)\{nd - 2Rd + 2/)! 

{nd-2Rd+2l) „ , , nd 

xl\{Rd - 2l)\{nd - 2Rd + 21)' ^ -e^'^-\nd)--{l + o(l)) 

n\ {{md-kd)\f {nd-Rdy. 

k\{{m - k)\y{n - R)\ n{{md - kd - l)\y {nd - 2Rd + 2l)\ 

^ (nd-2Rd+21) n,,, ., nd , , , , 

X {nd - 2Rd + 2/)^ -2 -e^'^-\ndy-{l + o(l)) 
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We now consider the logarithm of the expression above normahzed by n. Thus 

n-MogE[|7^(m,fc,/)|] 
= —X log X — 2{s — x) log(s — x) — {\ — 2s + x) log(l — 2s + x) 
+2{sd — xd) \og{sd — xd) — 2{sd — xd) — dylogdy + dy 
—2{sd — xd — dy) log{sd — xd — dy) + 2{sd — xd — dy) 
+ {d — 2ds + dx) \og{d — 2ds + dx) — {d — 2ds + dx) 
-{d- Ads + 2dx + 2dy) \og{d - Ads + 2dx + 2dy) + {d - Ads + 2dx + 2dy) 

+-{d - Ads + 2dx + 2dy) \og{d - Ads + 2dx + 2y) 

+(i(2s — X — y) — - log d 

= —X log X — 2(s — x) log(s — x) — {1 — 2s + x) log(l — 2s + x) 
+2{sd — xd) \og{sd — xd) — dy log dy 
~2{sd — xd — dy) \og{sd — xd — dy) 
+ {d — 2ds + dx) \og{d — 2ds + dx) 

— {d- Ads + 2dx + 2dy) \og{d - Ads + 2dx + 2dy) logii 

—2{sd — xd) + dy + 2{sd — xd — dy) — (rf — 2ds + dx) 
+ {d - Ads + 2dx + 2dy) + d{2s - x - y) 

The term not involving log is easily checked to be zero. Consider terms of the form \og{dA) = 
log d + log A and consider the log d terms. The corresponding multiplier is 

2{sd - xd) ~ dy - 2{sd - xd - dy) + {d - 2ds + dx) - \{d - Ads + 2dx + 2dy) - |, 

which again is found to be zero. The final expression we obtain is then 

= —X log X — 2{s — x) log(s — x) — {1 — 2s + x) log(l — 2s + x) 
+2d{s — x) log(s — x) — dylogy 
—2d{s — X — y) log(s — X — y) 
+d{l -2s + x) log(l -2s + x) 

— (1 -As + 2x + 2y) log(l -As + 2x + 2y). (14) 

We now recall that s = (1 + /3) Xogdjd and x lies in the interval (|9]). We consider now two cases. 
Specifically, we first consider the case 

(/3 + z + l)2i^<y<s-x, (15) 



and then consider the case 



N 9 log d , , 

0<i/<(/3 + ^ + l)'^. (16) 
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Assume first that ( ITSj) holds. Consider the terms containing y: 

fiy) = ~dy log y — 2d{s — x — y) log(s — x — y) — -{1 — As + 2x + 2y) log(l — 4s + 2x + 2y) 
Then 

d'^f{y) = -logy- l + 21og(s-a;-y) + 2 - log(l - As + 2x + 2y) - 1 
= — logy + 21og(s — X — y) — log(l — As + 2x + 2y). 

Now by our assumption flT5|) . we have y > {(3 + z + l)^(i~^ log^ rf implying 

-logy < -21og(/3 + z + l) + 21ogd-21oglogd 

Also 4s- 2a; -2y < 4s < Slogrf/rf = 0(log(i/(i), implying that log(l-4s + 2a; + 2y) = 0{\ogd/d). 
Finally, from ([9]) we have s — x — y<s — x = {(5 + z) logd/d, implying that log(s — x — y) < 
— log d + log log d + log(/3 + z). Combining, we obtain that 

d~^f{y) < -21og(/3 + ^ + 1) + 21ogci - 21oglogci - 21ogci + 21oglogci + 21og(/3 + z) 
+ (logd/d) 
= -2 log(/3 + z + l) + 2 log(/3 + z) + 0(log d/d). 

In particular, the derivative is negative for large enough d and thus the largest value of f{y) 
when y is in the interval flTSl) is obtained at the left end of this interval. Thus, without the loss 
of generality, we may assume from now on that the bound ( TT6l) holds. 

For convenience we start with the term (1 — 2s + x) log(l — 2s + x) in ( TT^ . Using the first 
order Taylor approximation log(l — t) = —t + o(t), and the fact s = 0(\ogd/d),x = 0(\ogd/d), 
we have 

(1 - 2s + x) log(l -2s + x) = 0{\ogd/d) 

= oi\og^d/d). 

Next we analyze the term d{l — 2s + x) log(l — 2s + x). Using the approximation 

(1 - t) log(l -t) = -t + e/2 + 0(t3), 

we obtain 

d{l - 2s + x) log(l - 2s + x) = -d{2s - x) + -(2s - xf + 0(ci(2s - x)^). 

Before we expand this term in terms of d, it will be convenient to obtain a similar expansion for 
the last term in (IT^ 



-(1 - 4s + 2x + 2y) log(l - 4s + 2x + 2y) 

= --{As - 2x - 2y) + -(4s - 2x - 2y)'^ + 0{d{As + 2x + 2yf) 

= -d{2s -x) + dy + d{2s - xf - 2d{2s - x)y + dy^ + 0{d{As + 2x + 2^)^) 
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Applying the upper bound ([IS]) we have 0{d{2s - x)^) = 0{\o^ d/d?) = oilog^ d/d), 0{d{As + 
2x + 2yY) = o(log^ d/d), and dy"^ = Oilog^ d/d^) = o(log^ d/d). Combining, we obtain 

d{l -2s + x) log(l - 2s + x) - -(1 - 4s + 2x + 2y) log(l - 4s + 2x + 2y) 
= --(2s - x)2 -dy + 2d{2s - x)y + o(log^ d/d) 

= -^{2s- x)' -dy + o(log2 d/d), (17) 

where again applying bound ( TT6l) on y we have used 

2ci(2s -x)y = U^^^\ = o(log2 d/d). 

Next it is convenient to analyze the following two terms together: 

2d{s — x) log(s — x) — 2d{s — x — y) log(s — x — y) 
= 2d{s — x) log(s — x) — 2d{s — x) log(s — x — y) + 2dy log(s — x — y) 
= 2d{s — x) log(s — x) — 2d{s — x) log(s — x) — 2d{s — x) log(l — y{s — x)^'^) 

+2dy\og{s — x) — 2dy\og{l — y{s — x)~'^)) 
= —2d{s — x) log(l — y{s — x)~'^) + 2dy\og{s — x) — 2dy\og{l — y{s — x)^'^)) 
= 2d{s - x)y{s - x)'^ + 0{dy'^{s - x)'^) 

+2dy\og{s -x) + 2dy^{s - x)~^ + 0{dy^{s - x)'^) 
= 2dy + 2dy\og{s - x) + 2dy^{s - x)'^ + 0{dy^{s - x)'^) + 0{dy%s - x)'^) 
= 2dy + 2dy\og{s-x) + o(log^ d/d) , 

where we have used the asymptotics y = 0(log^ d/d'^) implied by flT6|) in the last equality. 

We now analyze the remaining terms involving y. From flTTl) we have the term —dy. Com- 
bining with the asymptotics above and the remaining term —dylogy from flT^ we obtain 

2dy + 2dy\og{s — x) — dy — dylogy = dy + 2dy\og{s — x) — dylogy. (18) 

We compute the maximum value of this quantity in the relevant range of y given by (1161) . The 
first derivative of this expression is 

d + 2d log(s — x) — d — d log y = 2d log(s — x) — d log y 

which is positive (infinite) at y = 0. At y = ((3 + z + 1)"^ log^ d/d'^, the first derivative is 

2d log(s -x)-2d log(/3 + z + 1) - 2d log log d + 2d log d 

< 2d\og{/3 + z) + 2dloglogd - 2dlogd- 2dlog{l3 + ^ + 1) - 2rfloglogc/ + 2dlogd 

= 2d\og{(3 + z)- 2d\og{i3 + ^ + 1) 

<0, 

where the inequality relies on a; > (1 — z) log d/d implied by (jH]), which gives 

s — X < {/3 + z) log d/d. 

17 



The second derivative is —d/y which is negative since |/ > 0. Thus, the function is strictly 
concave with positive and negative derivatives at the ends of the relevant interval (fT6|l . The 
maximum is then achieved at the unique point y* where the derivative is zero, namely when 
2d\og{s — x) — d\ogy* = 0, giving 



y* = {s-xf. 



Plugging this into the right-hand side of ( 1181) we obtain 

dy*+2dy* log(s — x) — dy* logy* 

= d{s — xY + 2d(s — xY log(s — x) — d{s — xY log(s — x)^ 
= d{s — x)^. 

Summarizing, and using (ITTl) . we find that the expression in ([H]) is at most 

= — xlogx — 2(s — x) log(s — x) (2s — xY + d{s — xY + o(log^ d/d)^ 

which is precisely the expression (ITT!) we have derived for the case of Erdos-Renyi graph G(n, c/n), 
with the exception of the term (1 — 2s + x) log(l — 2s + x), which is o(log d/d) by ( !T2|) . We have 
obtained the expression we have analyzed for the case of graphs G{n,c/n), for which we have 
shown that the expression is negative for the specified choices of s and x for sufficiently large d. 
This completes the proof of Lemma 16.21 and of Theorem I2.6[ 
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